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r 1: XM_171629. Homo sapiens simi...[gi:2206223 1 1 

LOCUS L0C257238 1295 bp rnRNA linear PRI 01-AUG-2002 

DEFINITION Homo sapiens similar to cortical granule serine protease 1 
precursor (LOC257238 ) , rnRNA. 



VERSION 
REWORDS 
SOURCE 



Aj-i_ i / iuz: 

XM 171629. 1 01:22062231 



Hono sapiens (human) 
ORGANISM Hono sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Suteleostomi ; 
Mammalia; Euthena; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 12 95) 

AUTHORS NCBI Annotation Project. 
TITLE Direct Submission 

JOURNAL Submitted ( 31 - JUL -2 0 02 ) National Center for Biotechnology 
Information, NIH, Bethesda, MD 2 08 94, USA 
COMMENT GENOME ANNOTATION REFSEQ: This model reference sequence was 

predicted from NCBI contig NT._0Q9782 by automated computational 
analysis using cone prediction method: GenomeScan, supported by EST 
evidence . 
Also see: 

Documentation of NCBI ' s Annotation Process 



FEATURES 

source 



Locat 10:1/ Qua 1 i f lers 
1 . . 1295 

' jrcan i s::; " Homo sapi ens " 
-b :•; : . ■ : - ":axo:; : 96C6" 



, ::b xr n ; 11 I nterimlD : 2 :>7 2.:>t> " 

1 68 . . 1295 
/ ger.e="LOC2 57 2:-8 " 
/ :odon_s:art=l 
.■ product simi 1 a r to cort ;c 
p r o c u r s o r " 
>->>- n ♦■ p ; "id--"/': 1 ] ' ' ] ^ 2. () . . " 



granule set': no protease 



\rjiw \\ w .ncbi .nlm.nih.uo\/cntrc/Ajucr\ .k^i 'cmJ=RcUic\eLVdh=tuii McolRlcA:hsl_uids-2 ... 1 2/4/2002 



note- " Region : smart 00 02 0, Tryp_SPc, 'Trypsin- like serine 
protease; Many of these are synthesisecl as inactive 
i.irecursor zymogens that are cleaved during limited 
proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others 
are inactive due to substitutions of the catalytic triad 
residues " 
misc_f eature 171. .4 34 

gene = " TOC2 57 2 3 3 " 

note= " Region : pf am00089 , trypsin , Trypsin " 
misc.__f eature 819. . 1199 

gene = " hOC2 57 u 3 3 " 
, note- " Region : pfam02395, EGA! , Immunoglobulin Al 
protease. This family consists of immur.o j 1 obul in Al 
protease proteins. The immunoglobulin Al protease cleaves 
immunoglobulin IgA and is found in pathogenic bacteria 
such as Neisseria nnnnrrhnp,^e Not all ■ j f the members of 
this family are IgA proteases (one member from E. coli 
cleaves human coagulation factor V, another one is a 
hemoglobin protease) " 
mi sc_f eature 900 .. 1187 

gene -"LOCI": 57 2 3 8 " 

note-"Region : smart00020, Tryp_SPc, Trypsin -like serine 
protease; Many of these are synthesised as inactive 
precursor ::ymogens that are cleaved during limited 
proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and others 
are inactive due to substitutions of the catalytic triad 
res i dues " 
misc_f eature 90 3. .1137 

gene - " LOC2 57 .': 3 8 " 

note - "Region : pfamO0OS9, trypsin, Trypsin" 
variation h81 

gene = " LOCI: 5 72 3 3" 
allele-" C " 
allele-" T " 
. db_xr e f - " dbSM P : 3 7 4 2 0 7 1 " 
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RemevecVdb=iuUcotidcc'disl_ md>=2... 12/4/201): 



1261 ctttatgttt tgtcatctta ctagcaacaa cataa 



Revised: July 5, 2002. 
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>XM_171629 AGCESSION:XM_171629 NID: gi 2206223 1 ref XM_171629.1 Horn-:. 

sapiens similar to cortical granule serine protease 1 
precursor { LOG 2 : j 7 2 3 8 ) , mRJJA 
Length = -2 95 



Identities = 141/153 (922), Positives = 141 153 (92%) 
Frame - +3 

Query : 6 2 GG r A^P YKEY'YQGSR 1 1 GGTEAQAGAV; PVjVVSLQ) IKYGFVLVHVCGGTLYRERhVLTAAHC 12 1 

<:G7APLKDVLQGSRIIGG7EAQAGAVJPl^/SLvIKYGRVLVHVCGGTLVRE 

Sbjct : 3 CGTAPLKE'VLQGSRI IGGTEAQAGAWPWW^^^ 15 5 

ijuery : 122 T K D S D ? L MWT AV I G TNT J I H G P. Y P H T K K I ?' 1 YA 1 1 1 H PI IE I LES YYNDI ALFHbKKAVRYI J 181 

SDPLMVJTAVIGTrrjrHGP.YPHTKKIKIHAT I IHPNFILESYWDIALFHLKKAVRYN 
Sbjct : 156 .sUPl^lW'rAYIGTrr.JI}IGRYPHTP;P:i?:iFAi irHPIJFILESYVNDIALFHLKKAVRYIJ 326 

liuery: 182 DYIQPICLPFDYFQ ILDGI ITKGFI SGWGRTKEE 214 
D Y 1 Q P I C L 1 1 F D V F < ~ ' 1 L DGUT KG F 1 3 GVJG RT K E E 

Sbjct : 327 dytqpiclpfdvfqtldg:jtkgfisgv;grtkee 427 



Identities - 331/131 (100*;), Positives = 131,131 (100%) 
Frame = +3 

Query : 215 gijatnidjdaevhyisremc:jsersyggiip:jtsfcagdedgafdtcrgdsggplmcylp 274 

GNATrJILODAEVHYISREMCIISERSYGGIIPinVSFCAGDEDGAFDTCRGDSGGPLMCYLP 
Sbjct : 900 GIJATniDjDAEVHYlSRSMCIJSERSYGGIIPriTSFCAGDEDGAFDTCRGDSGGPLMCYLP 1079 

Query: 27 5 EYr'RFFVT^IGITSYGHGCGRRGFPGYYIGPSFYQKWLTEHFFHASTQGILTIIJILR.G'liILI 3 34 

ey?:rff\^igitsyghgi:grrgfpgvyigpsfyqkvjltehffhastqgiltiijilrgqili 

Sbjct : 1080 EY?:RFFWG1TSYGFIGCGRRGFPGYYIGPSFYQKV'JLTEHFFHASTqGILTINILR(3QILI 1259 



Query: 3 35 ALCFVILLATT 34 5 
ALCFVILLATT 



] 



>XM_171629 ACCESSION:XM_171629 NID: gi 22062231 ref XM_171629.1 Homo 
sapiens similar to cortical granule serine protease 1 
precursor ( LOC2 57 2 3 8 ) , mR:;A 
Length - 12 9 5 



Identities = 141/153 (92H, Positives = 141 153 (92%) 
Frame +3 

Ouery : 62 C 1 3 T A P L K D Y L ' J G S R 1 1 G G T E A 1 J A G A W P WW S L < J I K Y G R VL VH VC G G T L Y R. E R WYL T AAH C 121 

CGTAPLKDYLnGSR. I IGGTEAQAGAV/PWYVSLO I KYGRVLVHVCGGTLVRE 
Sb jcr. : 3 CGTAPLKD\ r LOGSRi:"'3GrFAC>A(3AVJPi^v^SLgI KYGRVLVHV(:(3GTLVPE 15 5 

nuery: 122 T K D S D P L MVJT A Y I G H J I H G R V P H T K } ' I : ' I P LA 1 1 1 H ? 1 1 F I L E S T/I J D I A L F H L K KJYY R Y I J 181 

SDPLMVJTAVIGTrnaHGKYPHTJIKIrlirAIIIHPIJFILEST/IJDIALFHLKP^VRYN 
Sbjet : 136 SDPLMl'/TAVIGTrHJIHGRYPHTf'.KIKIKA: I IH PIJFILEST/TIDIALFHLKKAVRYN 32 6 

■juery: 182 DYIQPICLPFOVFf'ILDGIITrY .TISGWGRTKEE 214 

DY'IClPICLPFDVFr-ILDGIJTr'.CFISGVJGRTKEE 
Sbjct: 327 DYIijPICLPFDVFQILDGIJTr'.CFISGVJGRTKEE 427 



Identities - 131/131 (100*.), Positives = 131/131 (100M 
Frame - +3 

Query: 215 GNATIJILODAEVHYISREMt'IJSERSYGGIIPIJTSFGAGDEDGAFDTCRGDSGGPLMCYLP 274 

GNATNILQDAEVHYISREMCIISERSYGGI IPI [TSFCAGDEDGAFDTCRGDSGGPLMCYLP 
Sb jet : 9 00 GNATNIL(jDAEVHYISREMC7JSERSYGGIIPIITSFCAGDEDGAFL)TCRGDSGGPLMCYLP 107 9 

Query: 27 5 EY};RFF\^lGITSYGHGCGRRGFPGVYIGPSFYriKWLTEHFFHASTr.GILTIIJILR(3r)ILI 33 4 

EYKRFFX^lGITSYGHGCGRF.GFPGVYIGPSFYQKWLTEHFFHASTQGILTIinLFlGOILI 
Sbjc*: : 108 0 EYKRFFV^IGITSYGHGGGRF:GFPGVYIGPSFYQKWLTEHFFHASTQGI LTIIJILRGQILI 12 59 

Query: 335 ALCFVILLATT 345 

ALOFVI LLATT 
Sh \c: : 1 260 ALORYTROATT 12 G 2 



